Implementation of Delta Compression

نویسنده

  • Chunpeng Xiao
چکیده

A Matlab simulation is carried out to verify the compression ratio analysis. Packet Xk and V are generated as two i.i.d. random sequences that follow a discrete uniform distribution between 0 and 255 with a packet length of 1,500 bytes. Packet Xk+1 is generated according to the simplified content generation model. Considering Xk and Xk+1 as two byte strings, our lossless delta compression algorithm is designed as follows. Between Xk+1 and Xk, Xk+1 is regarded as a string with many substrings copied from Xk and new bytes. Each substring is coded by its length and location in Xk. No assumptions about the format or alignment of the input data are made. The flowchart for our compression scheme is depicted in Figure 1. Essentially, this can be broken down into two steps: decomposing Xk+1 into substrings copied from Xk; coding the length and location of each substring i (leni, positioni) by a number of bits. In Step 1, obtaining the optimal partition for Xk+1 is a challenging topic. Several methods have been proposed to increase the searching speed and efficiency. For example, a hash function can be used for searching the longest matched substring. However, for short packets (say less than 1,500 bytes) or non-real time compression, hash tables are not necessary. One can simply search the current byte Xk+1(i) in Xk directly and look for the longest match. In Step 2, because leni cannot be guaranteed to be larger than 1 (otherwise a matched substring cannot be found), a modified substring coding scheme is used by adding an index bit: a single byte is coded by 9 bits (L1 = 1.125 bytes) with 1 bit for index and 8 bits for data content; a matched substring is coded by 14 bits (L2 = 1.75 bytes) with 1 index bit, 8 bits for length, and 5 bits for location. In Figure 2, the compression ratio for our lossless delta compression algorithm as well as the entropy curves are plotted for different p, which represents the degree of similarity between Xk+1 and Xk. From Figure 4, it is clear that the delta compression scheme can only be applied for larger values of p between 0.45 and 1 (i.e., 55% of the time). Intuitively, this makes sense since larger compression gains are obtained when there is large similarity between adjacent packets. Additional coding methods (e.g., Huffman coding) can be used to reduce these overheads associated with fixed length coding.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Implementation of VlSI Based Image Compression Approach on Reconfigurable Computing System - A Survey

Image data require huge amounts of disk space and large bandwidths for transmission. Hence, imagecompression is necessary to reduce the amount of data required to represent a digital image. Thereforean efficient technique for image compression is highly pushed to demand. Although, lots of compressiontechniques are available, but the technique which is faster, memory efficient and simple, surely...

متن کامل

Decrease in Hardware Consumption and Quantization Noise of Digital Delta-Sigma Modulators and Implementation by VHDL

A new structure is presented for digital delta-sigma modulator (DDSM). Novel architecture decreases hardware consumption, output quantization noise and spurs in Comparison to previous architectures. In order to reduce the delay, power consumption and increase maximum working frequency, the pipelining technique and the carry skip adder are used. Simulation proposed architecture shows that the qu...

متن کامل

Reducing Solid-State Storage Device Write Stress through Opportunistic In-place Delta Compression

Inside modern SSDs, a small portion of MLC/TLC NAND flash memory blocks operate in SLC-mode to serve as write buffer/cache and/or store hot data. These SLC-mode blocks absorb a large percentage of write operations. To balance memory wear-out, such MLC/TLC-to-SLC configuration rotates among all the memory blocks inside SSDs. This paper presents a simple yet effective design approach to reduce wr...

متن کامل

Migratory compression: coarse-grained data reordering to improve compressibility

We propose Migratory Compression (MC), a coarsegrained data transformation, to improve the effectiveness of traditional compressors in modern storage systems. In MC, similar data chunks are re-located together, to improve compression factors. After decompression, migrated chunks return to their previous locations. We evaluate the compression effectiveness and overhead of MC, explore reorganizat...

متن کامل

Delta Compressed and Deduplicated Storage Using Stream-Informed Locality

For backup storage, increasing compression allows users to protect more data without increasing their costs or storage footprint. Though removing duplicate regions (deduplication) and traditional compression have become widespread, further compression is attainable. We demonstrate how to efficiently add delta compression to deduplicated storage to compress similar (nonduplicate) regions. A chal...

متن کامل

Delta Compression Algorithms for Diverse Environments (CIKM2006)

Delta compression is the process of comparing two files to produce a set of instructions that will convert one file into the other. Storing or transmitting a delta file rather than the entire new file can offer significant efficiency gains. However, the different aspects of delta compression efficiency, like many problems in computer science, can rest at different ends of a balance. There does ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004